
o o cJ o 9 9 



CM 

d 



CO 



9" 

Si 




1 



ANALOG 
CONVERTING MEANS 



WAVEFORM CON- 
CATENATING MEANS 



< 

UJ 



— 



PITCH 
CONVERTING MEANS 



I 



AMPLITUDE 
CONVERTING MEANS 



I 



DURATION 
CONVERTING MEANS 



f— 



as 



• SPEECH 


UNIT 


OBTAINING 


MEANS 



I 



I 



^CHARACTER STRING 
ANALYZING MEANS 



CO 

CD 



lif CO 

< 5 

< H 

xeo 
o 



0= ^ 

oc Q 
o s 



CO 
CO 



FIG.5 



SPEECH SYNTHESIS PROGRAM 



c 



START 



:> 



INPUTTING CHARACTER 
STRING 



-S1 



MORPHOLOGICAL 
ANALYSIS 



OBTAINING ACCENT 
VALUE 



DETERMINIG DURATION 
FOR EACH SYLLABLE 



VOICED AND UNVOICED 
SOUND CLASSIFICATION 
FOR EACH PHONEME 



DETERMINING THE 
CONTOUR OF FUNDA- 
MENTAL FREQUENCY Fo 



DETERMINING THE 
CONTOUR OF 
AMPLITUDE 



OBTAINING SAMPLE 
SPEECH WAVEFORM FOR 
EACH SYLLABLE 



-82 



-S3 



-84 



.85 



-86 



-87 



-88 



6 



FIG.6 



1 



DURATION 
CONVERSION 



I 



AMPLITUDE 
CONVERSION 



-89 



-810 



PITCH 
CONVERSION 



I 



CONCATENATION OF J 
WAVEFORMS "l 


\ 


r 


OUTPUTTING SYNTHE- ^ 
SIZED SOUND WAVEFORM 




f 




812 



-813 



FIG.7 

PROGRAM FOR PITCH CONVERSION PROCESS 



START 



\ 




POINTING FIRST 
PITCH UNIT 



S21 




PONTING NEXT 
PITCH UNIT 




FIG.8 



WORD DICTIONARY 



WORD 


PART OF 
SPEECH 


READING 




SYLLABLE 


ACCENT 
VALUE 










s a 


1 




NOUN 


s a k u r a 




k u 


5 










r a 


O 






s a k u 




s a 


5 


VERB 




k u 


O 


1 
1 

j 


1 
1 
j 


1 
1 
1 
1 




1 
1 
1 
1 


1 
1 

1 




POST 
POSITIONAL 
PARTICAL OF 
JAPANESE 


g a 




g a 


2 


1 
1 
1 
1 


1 
1 
1 
1 


1 
1 

i 




I 
1 
1 


1 
1 
1 
1 



FIG.9 



DICTIONARY OF SYLLABLE DURATION 



SYLLABLE 


DURATION [ms] 


a 


1 1 O 


i 


1 1 4 


u 


90. 


! 


1 
1 


ko 


1 88 


1 
1 


1 
1 



lU 



O lO 10 O 



LU 
O 
O 
< 



E ^ 

'-' i 00 CM CM 00 CO 
O 00 O) T- h- (0 



-I 

00 

5 



CVJ ^ 



CO 



CO ^ O 00 O CM O 
Eh 00 00 O) h- 
^ CM ^ CD 



ai 
o 
z 

LU 

o o 

LU 
Z CO 

< LU 
X -J 

O CO 

fi 

> 

CO 



O 



o — 
— JO 

c o 



cd 




FIG.11 



k o Nn I c h I wa 



Fo 



FIG.12 



DICTIONARY OF VOICED / UNVOICED SOUNDS 
FOR CONSONANTS /VOWELS 



PHONEME 


INDEX 


a 


V 


i 


V 


1 
1 


1 
1 


k 


cu 


1 
1 


1 
1 


b 


cv 


1 
1 


1 
1 



FIG.13 



DICTIONARY OF SOUND SOURCE AMPLITUDE 



SYLLABLE 


SOUND SOURCE AMPLITUDE 


a 




-t 






Av 1 lOms 


1 
1 


1 
1 
1 


ko 


A. 










188 ms 




Av 




1 
1 
1 



A V : VOICED SOUND SOURCE AMPLITUDE 
A f : UNVOICED SOUND SOURCE AMPLITUDE 



FIG.14 



IMiLI^Uli 

"frrTrrr 



Ko N Do No Ni Chi 



CONTOUR OF 
SOUND SOURCE AMPLITUDE OF/ko/ 



CONTOUR OF 
FUNDAMENTAL FREQUENCY OF 



/ko/ |_ 



DURATION OF/ko/ 



1 23ms 



PITCH MARK 



ZERO CROSSING MARK 



O, 1 5. 30. •• 
1 2, 27. 42, 



o 

m 

■m 
m 
=c 

N= 
O 
G 
=P 

a 

G 




FIG.17 



SYLLABLE 
WEIGHT 


TYPE OF 
SYLLABLE 
WEIGHT 


SYLLABLE 
STRUCTURE 


EXAMPLE 


1 


LIGHT 
SYLLABLE 


\ w / I y / V 


Ka^ sa, la, na, na, ma, ya, ra,... 

a, i, u, e, o 

che, pya,.... 


2 


HEAVY 
SYLLABLE 


(C) (y)VR 
(C) (y)V J 
(C) (y)VN 
(C) (y) VQ 


to: , ya:, kyu:. pyu:.... 
kai, nou, at, ui, pyua,... 
kaN, aN, myaN, chaN,... 
chuQ, ryaQ, jaQ. hvaQ«... 


3 

OR MORE 


SUPER HEAVY 
SYLLABLE 


(C) (y)VRN 
(C) (y)VRQ 
(C) (y)VJN 
(C) (y)VJQ 
(C) (y) VNQ 

AND THE LIKE 


che:N, Ju:N, a:N, ... 
u:Q, che:Q,... 
saiN, pauN, auN, chaiN,... 
kaiQ, daiQ, kyaiO, uiQ,... 
doNQ, uNQ, chaNQ,... 



C : CONSONANT (EXCLUDING Q, Y AND N) 
y : SEMI VOWEL 

V : VOWEL (EXCLUDING R AND J) 
R : LONG VOWEL 

J : THE SECOND ELEMENT OF DIPHTHONG 
Q : GEMINATED SOUND (JAPANESE SOKUON) 
N : SYLLABIC NASAL 
(X) : SYLLABLE WEIGHT IS INDEPENDENT OF X 



CO 

z 
< 
liJ 
s 

o 



CO 

o 

S 

o 
o 



O 
u. 

LU 
> 

I 

X 

o 

LU 
LU 
Q. 

CO 



Si 



1 



ANALOG 
CONVERTING MEANS 



00 



X 



XL 



LU 

CO 
< 
00 

CO < 

o 



o 

lU 



CM 
CM- 



WAVEFORM CON- 
CATENATING MEANS 



o 

05- 



SPEECH UNIT 
WAVEFORM 
nFNFRATIMG MEANS 



CM 



" — CHARACTER STRING 
ANALYZING MEANS 



O 

1 



00 

d 



"J CO 

< 5 
Q£ cr 











NARY 1 
lOLOGI 
ALYSIS 










1/ 


9 o J 




FIG.19 




PITCH LENGTH 100ms 



T j — PITCH LENGTH 101ms 

V 

-r 1— PITCH LENGTH 102ms 





FIG.21 



S51 

J- 



SHORTENING 
SPEECH SOUND 
DURATION 








LOWE 
AMPLITU 
FUNDAr 
FREQI 


:RING 
IDE AND 
iffENTAL 
iENCY 



RAISING 
lAMPLITUOE ANDl 
FUNDAMENTAL 
FREQUENCY 



S54 



LENGTHENING 
SPEECH SOUND 
DURATION 



FIG.23 



qORIGINALq p o 

I. """" J IT 



ALLOCATING 
PITCH MARKS 




CLIPPING OUT 
WITH HANNING 
WINDOW 



OVERLAPPING 
IN COMPOSITE 
PITCH 



COMPOSITE 
WAVEFORM 



Ol 




FIG.28 



ORIGINAL SIGNALS (SOLID LINE) AND WINDOW FUNCTION (BROKEN LINE) 



Pi 



P3 



SIGNALS AFTER CONVERSION OF Fo 



P2 



